Unvoiced Landmark Detection for Segment-based Mandarin Continuous Speech Recognition

نویسندگان

Hua Zhang

Yun Tang

Wenju Liu

Bo Xu

چکیده

This paper presents an attempt to introduce unvoiced landmarks into statistical continuous speech recognition system. The unvoiced landmark detection algorithm proposed here locates the points in speech where the vocal folds stop or begin freely vibrating. In our experiments, 87.47% of stops and 98.94% of fricatives are segmented from speech after the unvoiced landmark detection, with a very low insertion error rate of 0.13%. Then these landmarks are incorporated into decoding process of segment model based recognizer as search beginning indicators. The effectiveness of landmark detection algorithm is verified in our landmark-guided recognition system with 240 sentences in 863Test database.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Landmark-Guided Segmental Speech Decoding for Continuous Mandarin Speech Recognition

In this paper, we propose a framework that attempts to incorporate landmarks into a segment-based Mandarin speech recognition system. In this method, landmarks provide boundary information and phonetic class information, and the information is used to direct the decoding process. To prove the validity of this method, two kinds of landmarks that can be reliably detected are used to direct the de...

متن کامل

A novel path extension framework using steady segment detection for Mandarin speech recognition

Frame based decoders are short of using long span of time knowledge while segment based decoders often confuse with complex calculating. This paper proposes a novel decoding framework by integrating steady speech segments information into path extension procedure. Firstly, as baseline decoding system, a dynamic lexicon-tree copy recognizer is developed, which aims to accelerate popular frame ba...

متن کامل

Mandarin Chinese tone nucleus detection with landmarks

This paper discusses a new approach to improve tone recognition by modeling the tone nucleus with vowel landmark detection. The tone nucleus region is identified based on vowel landmark frames derived by an automatic landmark recognition system. In the corresponding tone recognition experiments, the best results with landmark-based tone nucleus regions outperform the best baseline system result...

متن کامل

Robust F0 modeling for Mandarin speech recognition in noise

The F0 contour plays an important role in recognizing spoken tonal languages like Mandarin Chinese. However, the discontinuity of F0 between voiced and unvoiced transition has traditionally been a bottleneck in creating a succinct statistical tone model for automatic speech recognition applications. By applying successfully the Multi-Space Distribution (MSD) to tone modeling, we recently report...

متن کامل

Generation of Fundamental Frequency Contours of Mandarin in HMM-based Speech Synthesis using Generation Process Model

The HMM-based speech synthesis system can produce high quality synthetic speech with flexible modeling of spectral and prosodic parameters. In this approach, short term spectra, fundamental frequency (F0) and duration are generated by multi-stream HMMs separately. However the quality of synthetic speech degrades when feature vectors used in training are noisy. Among all noisy features, pitch tr...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2006

Unvoiced Landmark Detection for Segment-based Mandarin Continuous Speech Recognition

نویسندگان

چکیده

منابع مشابه

Landmark-Guided Segmental Speech Decoding for Continuous Mandarin Speech Recognition

A novel path extension framework using steady segment detection for Mandarin speech recognition

Mandarin Chinese tone nucleus detection with landmarks

Robust F0 modeling for Mandarin speech recognition in noise

Generation of Fundamental Frequency Contours of Mandarin in HMM-based Speech Synthesis using Generation Process Model

عنوان ژورنال:

اشتراک گذاری